[Frontend] Delegate tokenization serving preprocessing to OpenAIServingRender by sagearc · Pull Request #37266 · vllm-project/vllm

sagearc · 2026-03-17T07:52:55Z

Purpose

OpenAIServingRender (#36166) is the canonical, engine-free home for request preprocessing. #36483 wired it into OpenAIServingChat and OpenAIServingCompletion, but OpenAIServingTokenization was left calling the duplicate copies on OpenAIServing directly. This PR continues that cleanup by delegating tokenization serving preprocessing to OpenAIServingRender, and moves its construction to init_app_state so it's available to all serving classes from the start.

OpenAIServingTokenization now delegates the following methods to openai_serving_render instead of calling the OpenAIServing base class copies:
- _validate_chat_template
- _preprocess_chat
- _preprocess_completion
OpenAIServingRender construction is moved out of init_generate_state into init_app_state so it's available earlier and shared across all serving classes.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

…ngRender Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

gemini-code-assist

Code Review

This pull request refactors the initialization and usage of the OpenAIServingRender component. The OpenAIServingRender instance is now created centrally in api_server.py's init_app_state function, rather than in api_router.py's init_generate_state. Subsequently, the OpenAIServingTokenization service is updated to accept and utilize this OpenAIServingRender instance, delegating chat template validation and prompt preprocessing methods (_validate_chat_template, _preprocess_chat, _preprocess_completion) to it, thereby improving modularity and separation of concerns.

vllm/entrypoints/serve/tokenize/serving.py

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

[Frontend] Delegate tokenization serving preprocessing to OpenAIServi…

9534108

…ngRender Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

sagearc requested review from DarkLight1337, aarnphm, chaunceyjiang, njhill and russellb as code owners March 17, 2026 07:52

mergify bot added the frontend label Mar 17, 2026

gemini-code-assist bot reviewed Mar 17, 2026

View reviewed changes

DarkLight1337 reviewed Mar 17, 2026

View reviewed changes

vllm/entrypoints/serve/tokenize/serving.py Outdated Show resolved Hide resolved

cr fix

f538d00

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

sagearc requested review from NickLucche and robertgshaw2-redhat as code owners March 17, 2026 08:26

DarkLight1337 approved these changes Mar 17, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) March 17, 2026 08:27

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 17, 2026

DarkLight1337 merged commit 00f8e0d into vllm-project:main Mar 17, 2026
48 checks passed

sagearc deleted the delegate-openai-tokenization-to-renderer branch March 17, 2026 11:23

Lucaskabela pushed a commit to Lucaskabela/vllm that referenced this pull request Mar 17, 2026

[Frontend] Delegate tokenization serving preprocessing to OpenAIServi…

34eee12

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

andylolu2 pushed a commit to andylolu2/vllm that referenced this pull request Mar 18, 2026

[Frontend] Delegate tokenization serving preprocessing to OpenAIServi…

d03f969

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026

[Frontend] Delegate tokenization serving preprocessing to OpenAIServi…

c14e4fb

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

fxdawnn pushed a commit to fxdawnn/vllm that referenced this pull request Mar 19, 2026

[Frontend] Delegate tokenization serving preprocessing to OpenAIServi…

e3f9625

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026

[Frontend] Delegate tokenization serving preprocessing to OpenAIServi…

f15b075

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

JiantaoXu pushed a commit to JiantaoXu/vllm that referenced this pull request Mar 28, 2026

[Frontend] Delegate tokenization serving preprocessing to OpenAIServi…

0bb041d

…ngRender (vllm-project#37266) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Frontend] Delegate tokenization serving preprocessing to OpenAIServingRender#37266

[Frontend] Delegate tokenization serving preprocessing to OpenAIServingRender#37266
DarkLight1337 merged 2 commits intovllm-project:mainfrom
sagearc:delegate-openai-tokenization-to-renderer

sagearc commented Mar 17, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

sagearc commented Mar 17, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sagearc commented Mar 17, 2026 •

edited by github-actions bot

Loading